Predicting Patch Correctness Based on the Similarity of Failing Test Cases

نویسندگان

چکیده

How do we know a generated patch is correct? This key challenging question that automated program repair (APR) systems struggle to address given the incompleteness of available test suites. Our intuition can triage correct patches by checking whether each implements code changes (i.e., behavior) are relevant bug it addresses. Such commonly specified failing case. Towards predicting correctness in APR, propose novel yet simple hypothesis on how link between behavior and specifications be drawn: similar cases should require . We then BATS , an unsupervised learning-based approach predict B ehavior A gainst T est S pecification. exploits deep representation learning models for patches: For case, yielded embedding used compute similarity metrics search historical identify associated applied patches, which as proxy assessing APR-generated patches. Experimentally, first validate our ground-truth developer cluster together same way their clustered. Then, after collecting large dataset 1,278 plausible (written developers or 32 APR tools), use achieves AUC 0.557 0.718 recall 0.562 0.854 identifying outperforms state-of-the-art techniques without need labeled datasets—as case with machine approaches. While constrained availability cases, show still complementary existing approaches: When combined recent relies supervised learning, improves overall detecting finally PATCH-SIM dynamic tools.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying Patch Correctness in Test-Based Program Repair

Test-based automatic program repair has attracted a lot of attention in recent years. However, the test suites in practice are often too weak to guarantee correctness and existing approaches often generate a large number of incorrect patches. To reduce the number of incorrect patches generated, we propose a novel approach that heuristically determines the correctness of the generated patches. T...

متن کامل

the role of task-based techniques on the acquisition of english language structures by the intermediate efl students

this study examines the effetivenss of task-based activities in helping students learn english language structures for a better communication. initially, a michigan test was administered to the two groups of 52 students majoring in english at the allameh ghotb -e- ravandi university to ensure their homogeneity. the students scores on the grammar part of this test were also regarded as their pre...

15 صفحه اول

the effect of genre-based teaching on reading comprehension of literary texts

تحقیق حاضر به بررسی کاربرد روش ژانر-محور را در محیط آموزش زبان عمومی می پردازد.روش ژانر-محور به زبان آموزان کمک میکند که در زمینه خوانش پیشرفت کنند. بعضی از محققین معتقد اند که روش تدریس ژانر-محور به تدریج به زبان آموزان کمک می کند تا در درک ژانر های مختلف مهارت یابند (هایلند 2004).همچنین امروزه توجه روز افزونی به اهمیت استفاده از ادبیات در برنامه آموزشی زبان انگلیسی (esl/efl ) شده است. زمانی ک...

15 صفحه اول

the washback effect of iranian schoolleaving test of english on students learning

هدف از ای پایان نامه بررسی تا ثیر ازمون نهایی زبان انگلیسی سال سوم دبیرستان برروی یادگیری دانش اموزان ایرانی است. هدف اشلی این مطالعه بررسی خود این ازمون و تا ثیر ان برروی یادگیری دانش اموزان است. برای رسیدن به این هدف شیوه های تحقیقاتی مختلفی برای جمع اوری و تحلیل داده ها به کار برده شده است. ابزارهای اصلی این تحقیق شامل مصاحبه با دانش اموزان دختر سال سوم (10 نفر) و همچنین مشاهده ی 6 کلاس زبان...

the effect of functional/notional approach on the proficiency level of efl learners and its evaluation through functional test

in fact, this study focused on the following questions: 1. is there any difference between the effect of functional/notional approach and the structural approaches to language teaching on the proficiency test of efl learners? 2. can a rather innovative language test referred to as "functional test" ge devised so so to measure the proficiency test of efl learners, and thus be as much reliable an...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Software Engineering and Methodology

سال: 2022

ISSN: ['1049-331X', '1557-7392']

DOI: https://doi.org/10.1145/3511096